07. Text: Metadata

Review the column Metadata

Some of the columns you want to use in your project will have coded values that represent longer more readable values. For instance the cancellation_reason column in the flights data set has the values: A, B, C, D
These letters are not understandable by themselves. You need to replace these letters with the full reason to make your visualizations including this data more readable.

These letters correspond with the following reasons.

A - Airline/Carrier

B - Weather

C - National Air System

D - Security

You should review the Column Metadata tab on Kaggle for each data set to find details about the data like the one I have outlined above.

Flights

link here: https://www.kaggle.com/usdot/flight-delays/data

US Demographic data

link here: https://www.kaggle.com/muonneutrino/us-census-demographic-data/data

  • click the second data link that is the county file which is what we are using for the project

Youtube US data

link here: https://www.kaggle.com/datasnaek/youtube-new/data

  • There is no column metadata for this file, reach out on Study Groups or Knowledge if you are needing assistance interpreting a column
  • The location data was added so that you could make use of the map visualization, treat this as the upload location for each video.

Common Questions and Answers:

https://knowledge.udacity.com